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The 3C-lil<e proteinase (3CL pro ) of the severe acute respiratory 
syndrome (SARS) coronavirus plays a vital role in virus maturation 
and is proposed to be a key target for drug design against SARS. 
Various in vitro studies revealed that only the dimer of the matured 
3CL pro is active. However, as the internally encoded 3CL pro gets 
matured from the replicase polyprotein by autolytic cleavage at 
both the N-terminal and the C-terminal flanking sites, it is unclear 
whether the polyprotein also needs to dimerize first for its auto- 
cleavage reaction. We constructed a large protein containing the 
cyan fluorescent protein (C), the N-terminal flanking substrate 
peptide of SARS 3CL pro (XX), SARS 3CL pro (3CLP), and the yellow 
fluorescent protein (Y) to study the autoprocessing of 3CL pro using 
fluorescence resonance energy transfer. In contrast to the matured 
3CL pro , the polyprotein, as well as the one-step digested product, 
3CLP-Y-His, were shown to be monomeric in gel filtration and ana¬ 
lytic ultracentrifuge analysis. However, dimers can still be induced 
and detected when incubating these large proteins with a substrate 
analog compound in both chemical cross-linking experiments and 
analytic ultracentrifuge analysis. We also measured enzyme activ¬ 
ity under different enzyme concentrations and found a clear tend¬ 
ency of substrate-induced dimer formation. Based on these discov¬ 
eries, we conclude that substrate-induced dimerization is essential 
for the activity of SARS-3CL pro in the polyprotein, and a modified 
model for the 3CL pro maturation process was proposed. As many 
viral proteases undergo a similar maturation process, this model 
might be generally applicable. 


Shortly after its outbreak in 2003, severe acute respiratory 
syndrome (SARS) * 1 2 was confirmed to be caused by a new type of 
coronavirus, SARS-CoV. Similar to other coronaviruses, two- 
thirds of its genome encodes two large replicase polyproteins, 
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pp la (450 kDa) and pp lab (750 kDa), which will undergo exten¬ 
sive proteolytic processing mainly by the internally encoded 
main proteinase (also called 3C-like proteinase, 3CL pro ) to pro¬ 
duce multiple functional subunits that mediate both genome 
replication and transcription (1). 

The crystal structure of SARS wild type 3CL pro is a 
homodimer (Protein Data Bank codes 1Q2W, 1UJ1, and 1UI<2) 
and topologically similar to the other coronaviruses such as 
transmissible gastroenteritis coronavirus and human coronavi¬ 
rus 229E (2, 3) (Protein Data Bank codes 1LVO and 1P9U). One 
protomer of 3CL pro consists of three domains, of which the first 
two form a chymotrypsin fold, and is connected by a long loop 
with the third extra helix domain. The catalytic dyad, His-41 
and Cys-145, locates in the deep cleft between domain I and II. 
We have shown that the dimer is the biologically functional 
form of matured 3CL pro (4), and only one monomer is active (5). 
Certain single site mutations, such as M6A, G11A, S39A, and 
R298A, result in inactive monomers (6-9). 

Dimerization is a commonly used strategy in viral protease 
activity regulation (10). Many of the reported viral proteases are 
active only in dimer form (4,11-14). For enzymes with an active 
site formed by residues from both the protomers, like HIV-1 
protease (15), it is straightforward to understand why the dimer 
formation is necessary. However, for proteases with a complete 
active site in one protomer, dimerization may be one method to 
regulate its activity. It is interesting to know what happens for 
these proteases before they have been cleaved out from the 
polyprotein as it might be difficult for a large flexible polypro¬ 
tein to form a stable dimer. A recent study revealed that the 
mini-precursor of HIV-1 protease formed a highly transient but 
low populated dimeric structure during maturation (16). 

For SARS 3CL pro , much has been learned about its catalytic 
mechanism (17), substrate specificity (18), as well as inhibitor 
design (19, 20). However, few studies have been reported on its 
maturation mechanism. Shan et al (21) introduced a 31-mer 
peptide containing an autocleavage site flanking to the N ter¬ 
minus of 3CL pro to test the in cis activity of SARS 3CL pro . They 
found that the peptide can be autocleaved efficiently by 3CL pro 
itself by monitoring the autocleavage products on the gel. Hsu 
et al. (22) reported that the SARS 3CL pro can be matured from 
polyprotein with flanking N- and C-terminal segments in vitro . 
The N terminus is digested faster than the C terminus, and 
during digestion, the 3CL pro with 10 residues attached to both 
the N terminus and the C terminus can form dimer. However, 
10 residues flanking at the N terminus and C terminus might be 
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too short and may not reflect the real situation in the polypro¬ 
tein. To understand the maturation and activity regulation 
mechanism of SARS 3CL pro , we constructed an artificial 
polyprotein containing cyan fluorescent protein (C), the 
N-terminal natural flanking substrate peptide of SARS 3CL pro , 
SITSAVLQ (XX), SARS 3CL pro (3CLP), and yellow fluorescent 
protein (Y) and used it to study the autoprocessing mechanism 
of SARS 3CL pro polyprotein. In contrast to the dimerization of 
the matured enzyme, this polyprotein and its N-terminal 
cleaved product were found to be monomeric in conventional 
analysis but are still active. 

EXPERIMENTAL PROCEDURES 

Plasmid Construction of the Large Fusion Proteins His-C- 
XX(Q/E)-3CLP-Y, His-C-XX-Cl45A-Y, 3CLP-Y-His, and 3CLP- 
MBP-His —The large fusion protein constructs basically con¬ 
tained four components in sequence. The first component is C, 
and the second component is XX, which is the natural auto¬ 
cleavage substrate peptide of SARS-CoV 3CL pro derived from 
the natural ppla/pplab polyprotein attached to its N terminus. 
Sequence analysis revealed 11 cleavage sites of SARS 3CL pro on 
the SARS polyproteins, all of which contain a highly conserved 
substrate sequence (PI Gin i PI' Ser/Ala) (23). The N-terminal 
autolytic substrate peptide (SITSAVLQ i SGF) has the highest 
cleavage efficiency among all 11 substrate peptides (4). The 
third component consists of 3CLP or mutants, and the 
fourth component is Y, so the construct is named 
C-XX-3CLP-Y. 

To prepare His-C-XX(Q/E)-3CLP-Yand His-C-XX-C145A-Y, 
a cloning vector pET28a-CY2.0 with His 6 tag at the N terminus 
was built first (supplemental Fig. SI). Detailed experimental 
procedures can be found in the supplemental material. Here, 
Q/E represents the mutation of Gin in the extra 8-amino acid 
peptide (SITSAVLQ) to Glu. Mutation of the critical residue 
Gin at the PI position to Glu completely abolishes the hydro¬ 
lytic activity of SARS 3CL pro (18). 

3CLP-Y-His is the truncated protein of His-C-XX(Q/E)- 
3CLP-Y, which also had a His 6 tag at the C terminus for further 
protein purification (supplemental Fig. SI). 3CLP-MBP-His 
was a plasmid in which the YFP of 3CLP-Y-His was replaced by 
maltose-binding protein (MBP) (supplemental Fig. SI). 

All the plasmids were constructed following the methods 
listed in the supplemental material, and primers needed were 
listed in supplemental Table SI. All constructs were confirmed 
by DNA sequencing (Invitrogen). 

Fusion Protein Expression and Purification —The large poly¬ 
proteins with His 6 tags were expressed and purified with a 
slightly modified procedure when compared with the matured 
enzyme (4) (see detailed information in supplemental material). 
The method for wild type SARS-CoV 3CL proteinase expres¬ 
sion and purification was the same as reported before (17). 

Analytic Gel Filtration Analysis —The aggregation state of 
the large fusion proteins, His-C-XX(Q/E)-3CLP-Y and His- 
C-XX-C145A-Y, and the truncated one, 3CLP-Y-His, were 
analyzed using a Superdex 200 HR 10/300 GL column (GE 
Healthcare) on AKTA fast protein liquid chromatography. 
The purified proteins, at different concentrations, were 
loaded on the column, which was pre-equilibrated to 36 ml 



of buffer A (40 mM phosphate-buffered saline, pH 7.3, 100 
mM NaCl, 1 mM EDTA). The column was eluted with another 
36 ml of buffer A at flow rate of 0.5 ml/min, during which the 
eluted volume of the fraction peak at 280 nm was monitored 
on fast protein liquid chromatography. The concentrations 
loaded onto the column were: 1) His-C-XX(Q/E)-3CLP-Y, 

4.4 and 10 mg/ml; 2) His-C-XX-C145A-Y, 4.4 and 10 mg/ml; 
and 3) 3CLP-Y-His, 6 mg/ml. Gel filtration molecular weight 
markers used and the standard calibration curve can be 
found in supplemental Table S2 and supplemental Fig. S3. 

Analytic Ultracentrifugation (AUC) Analysis —Sedimenta¬ 
tion velocity experiments were carried out using a Beckman 
Optima XLA analytical ultracentrifuge. The procedure of sed¬ 
imentation velocity was the same as reported previously (6). 
His-C-XX(Q/E)-3CLP- Y, His-C-XX-C145A-Y, and 3CLP-Y- 
His were prepared in buffer A. Samples (1 mg/ml, 380 pi) and 
reference (400 /xl) were loaded into double-sector centerpieces. 
Data were analyzed with Sedfit version 11.71. 

In Vitro Trans-cleavage Activity Assay —7.5 /xm His-C-XX(Q/ 
E)-3CLP-Y(or 3CLP-Y-His or 3CLP-MBP-His) as the enzyme 
and 22.5 pM His-C-XX-C145A-Y as the substrate were mixed 
and incubated in buffer A with 5 mM dithiothreitol at 37 °C with 
500 rpm of shaking for 2 h. The substrate His-C-XX-C145A-Y 
was cleaved at the substrate peptide Q i S bond to release two 
fragments, which eliminated FRET and resulted in a decrease at 
527 nm. To confirm the molecular weight of the two products, 
sample aliquots of 10 jixl were taken out at different reaction 
times. For enzyme His-C-XX(Q/E)-3CLP-Y, samples were 
taken out at 0, 30, 60, 90, and 120 min and put on ice. In addi¬ 
tion, for 3CLP-Y-His, samples were taken out at 1,2, 3,4, 5,6, 8, 
10, 15, 20, and 30 min. One part of the reaction mixture (5 jixl) 
was diluted 40 times into ice-cold buffer A for FRET, and the 
other 5 pi was mixed with 5 jixl of 2X SDS-loading buffer for 
SDS-PAGE on 10% polyacrylamide gel. Control assays were 
also performed by incubating 7.5 pM His-C-XX(Q/E)-3CLP-Y 
(or 3CLP-Y-His) and 22.5 pM His-C-XX-C145A-Yalone at the 
same conditions as the first test described above. 

Kinetic Measurement of Enzyme Activity —A colorimetric 
substrate, Thr-Ser-Ala-Val-Leu-Gln-pNA (GL Biochemistry 
Ltd.), was used for enzyme concentration-dependent kinetic 
measurement (17). The substrate was cleaved at the Gln-pNA 
bond to release free pNA, resulting in an increase of absorbance 
at 390 nm (measured using Synergy™4, BIOTEK). Enzyme 
concentration ranges used here were 2.25-45 pM for His-C- 
XX(Q/E)-3CLP-Y, 1.36-21.78 pM for 3CLP-Y-His, and 0.225- 

4.5 pM for SARS-3CL pro in a 100- pi volume, respectively. The 
enzyme concentration dependence of rate constants was 
measured at 37 °C in buffer A with 5 mM dithiothreitol. The 
apparent second-order rate constant /c cat //<T m was calculated 
by dividing the pseudo-first-order constant by the enzyme 
concentration (17, 24). We have derived a fitting model to 
obtain intrinsic /c cat /X m (24). To be brief, the apparent 
enzyme activity was contributed from the dimers as well as 
the monomers in solution. 


(^cal/^m)app (^cat/^m)mono^-mono/^-F ~F (^cat/^m)dimer^-dimer/^-f 


(Eq.1) 
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in which (/c cat //<T m ) app is the apparent second-order rate con- 
stant, ik c JKJ mono and (k c JI< m ) dimer are that of the monomer 
and dimers, and C E is the total enzyme concentration. C mono 
and C dimer are the concentration of monomer and dimer, 
respectively. 

As the monomers can associate into dimers, we get 

M + M = M 2 K a = [M 2 ]/[M] 2 (Eq. 2) 

C £ =[M] + 2[M 2 ] (Eq. 3) 

thus 

Cmono = (“1 + 0 + 8/C fl C f ) 1/2 )/4/C 0 (Eq. 4) 

Cdimer = (4/f fl C f + 1 - (1 + 8K a C E ) y2 )/8K a (Eq. 5) 

in which M is monomer, M 2 is dimer, and K a is the monomer- 
dimer association constant. 

We have tried to fit Equation 1 and found that the activities of 
monomers were negligible for the matured 3CL pro , as well as 
the polyproteins. This confirmed again that dimer is the active 
form. We then omitted the contribution from the monomers 
and fitted the data using Origin 8.0 (OriginLab) according to 
the following equation (24). 

(k cat /K m ) app = (/C C at/^m)dimer(4/f 0 C f +1 ~ (1 + 8K a C E ) V2 )/8K a C E 

(Eq. 6) 

The /c cat //C m of dimer and its association constant can be 
obtained from the fitting. 

Inhibition Efficiency of the Isatin Compound Sf on His-Y- 
XX(Q/E)-3CLP-Y—A known inhibitor, the isatin derivative, 
l-(2-naphthlmethyl) isatin-5-carboxamide (5f) (25), was dis¬ 
solved in DMSO to test the inhibition efficiency toward the 
polyprotein. To reduce the high background of FRET, another 
large fusion protein plasmid, His-Y-XX(Q/E)-3CFP-Y, was pre¬ 
pared as above, replacing the first cyan fluorescent protein by 
YFP, which was used as the enzyme for the inhibition test. 7.5 
/xm His-Y-XX(Q/E)-3CFP-Y and 50 /xm 5f (control was 5% 
DMSO) were preincubated in buffer A at 37 °C for 15 min, and 
then 22.5 /xm His-C-XX-C145A- Y (substrate) was added to the 
mixture to react for 2 h. Samples were taken at different reac¬ 
tion times for FRET measurement. 

Chemical Cross-linking —Enzymes (6 mg/ml His-C-XX(Q/ 
E)-3CFP-Y, 6 mg/ml 3CLP-Y-His, and 5.12 mg/ml SARS- 
3CL pro ) alone or mixed with equal molar 5f were cross-linked 
by 10-fold molar excess ethylene glycol bis (succinimidyl succi¬ 
nate) in buffer A at room temperature for 30 min and then 
quenched by adding Tris (1 m, pH 7.5) to a final concentration 
of 50 mM. 

RESULTS 

Aggregation State of the Free Polyproteins —We investigated 
the aggregation states of the fusion proteins using gel filtration 
and analytic ultracentrifugation to see whether they can still 
form dimers in solution. To avoid autocleavage, two mutants, 
His-C-XX(Q/E)-3CFP-Yand His-C-XX-C145A-Y, were used 
for aggregation studies. The Gin to Glu mutation makes the 
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Sedimentation Coefficient (S) 

FIGURE 1. Sedimentation coefficient distribution c(s) profile of His-C- 
XX(Q/E)-3CLP-Y5 His-C-XX-CI45A-V, and 3CLP-YHis at 40,000 rpm and 
20 °C. 

substrate sequence XX uncleavable, whereas the Cys-145 to Ala 
mutation results an inactive enzyme. 

Analytic Gel Filtration Analysis —Both His-C-XX(Q/E)- 
3CFP-Yand His-C-XX-C145A-Y showed high FRET efficiency 
(supplemental Fig. S2), indicating that they were intact. A 
Superdex 200 HR 10/300 GL column was used to estimate their 
apparent molecular weights based on the retention volume 
at different protein concentrations (see details in the sup¬ 
plemental material). Both proteins were shown to be mono¬ 
meric even at high concentration of 10 mg/ml (supplemental 
Table S3 and supplemental Fig. S3). 

Analytic Ultracentrifugation Analysis —To further verify the 
aggregation state of the polyproteins, sedimentation velocity 
experiments were performed. The sedimentation experiments 
provide hydrodynamic information about the molecular size 
distribution and conformational changes in the native solution 
with no dilution effect when compared with gel filtration. Sed¬ 
imentation velocity was used to determine the sedimentation 
coefficient distribution c(s) and the molecular weight distribu¬ 
tion c(M) of polyproteins at 1 mg/ml. The resulting c(s) and 
c(M) distribution profiles showed that only one peak was 
detected (Fig. 1 and supplemental Fig. S4). Sedimentation coef¬ 
ficients of His-C-XX(Q/E)-3CFP-Y and His-C-XX-C145A-Y 
were —4.9 and —5.0 S, respectively. Based on the c(M) distri¬ 
bution model, the molecular masses of His-C-XX(Q/E)- 
3CFP- Y and His-C-XX-C145A-Ywere estimated to be 82.7 and 
86.0 kDa (supplemental Table S4), indicating that both of them 
were monomeric. 

In Vitro Trans-cleavage Activity Assay of 3CL pro in Poly¬ 
protein —The fusion polyproteins containing SARS 3CF pro 
were shown to be monomeric in solution. To test its activity, 
trans- cleavage was monitored in vitro by FRET signal and con¬ 
firmed through SDS-PAGE. 
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FIGURE 2. The enzymatic activity of His-C-XX(Q/E)-3CLP-Y 'in vitro. A, FRET 
spectra at different reaction times. B, SDS-PAGE for the digestion at different 
times. From lanes 1-5, samples from 0-, 30-, 60-, 90-, and 120-min reaction 
times were loaded, respectively. The bands from top to bottom show the 
mixture of His-C-XX(Q/E)-3CLP-Yand His-C-XX-CI 45A-Y, the cleavage product 
I C145A-Y, and the cleavage product II His-C-XX. 


His-C-XX(Q/E)-3CFP-Ywas used as the enzyme, and His-C- 
XX-C145A-Y was used as the substrate. Two control experi¬ 
ments were also carried out by incubating His-C-XX(Q/E)- 
3CFP-Y and His-C-XX-C145A-Y for 120 min, and the 
fluorescence spectra did not change. At the beginning of the 
reaction, a high efficiency FRET signal was detected, which 
decreased along with the digestion (Fig. 2), indicating that 
His-C-XX(Q/E)-3CLP-Y digested the substrate His-C-XX- 
C145A- Ywell. Further supporting evidence was obtained from 
SDS-PAGE. In the beginning, only one band at 91 kDa was 
observed. After incubation for some time, the 91-kDa band 
became weaker. At the same time, two lower molecular mass 
bands appeared and turned thicker, indicating that His-C-XX- 
C145A-Y was digested into two parts (about 27 and 64 kDa). 
The monomeric polyprotein can still retain its enzymatic activ¬ 
ity in vitro . 

Properties of the Truncated Fusion Protein 3CLP-Y-His —The 
results shown above indicate that the 3CL pro in the polyprotein 
appeared to be monomeric and was enzymatically active. We 
then wanted to know the property and activity of the N-termi¬ 
nal cleaved polyprotein. A truncated protein, 3CLP-Y-His, con¬ 
taining the 3CL pro , the YFP protein, and a C-terminal His tag, 
was expressed and purified in vitro . The enzymatic activity was 
tested using His-C-XX-C145A-Y as substrate. 3CLP-Y-His 
digested the substrate His-C-XX-C145A-Yvery rapidly. Within 
15 min, almost 95% of the substrate was digested (Fig. 3), which 
is much faster than the full polyprotein (Fig. 2). Gel filtration 
and analytic ultracentrifugation analysis showed that 3CFP-Y- 
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FIGURE 3. The enzymatic activity of 3CLP-YHis in vitro. A, spectra of FRET 
monitoring of enzymatic activity of truncated fusion protein 3CLP-YHis with 
different reaction times. B, samples from different reaction times were loaded 
onto SDS-PAGE. Lanes 1-4 show 0,1,15, and 30 min. The bands from top to 
bottom show His-C-XX-CI45A- Y, a mixture of 3CLP-YHis and cleavage prod¬ 
ucts I C145A-Y, and cleavage product II His-C-XX. 


His was also monomeric (Fig. 1, supplemental Figs. S3 and S4, 
and supplemental Tables S3 and S4). 

To eliminate the possible effect of the YFP protein in 3CFP- 
Y-His, we substituted it with MBP protein to construct 3CFP- 
MBP-His, which showed almost the same activity as 3CFP-Y- 
His. We concluded that after the first step of autocleavage, the 
product containing a free N terminus and one protein attached 
to the C terminus remained monomeric in solution and with 
higher enzyme activity than the polyprotein with proteins 
attached to both ends. This enzyme activity was not related to 
the proteins attached to the 3CFP termini. 

Inhibition Efficiency of an Isatin Inhibitor on His-Y- 
XX(Q/E)-3CLP-Y— The isatin derivative 5f is a non-covalent 
SARS coronavirus 3C-like protease inhibitor, with an IC 50 of 
0.37 /x m. We used 5f as a probe to test whether the substrate 
binding pocket conformation of 3CF pro in the polyprotein 
remains the same as that in the free 3CF pro . 5f can almost fully 
inhibit the enzymatic activity of His-Y-XX(Q/E)-3CFP-Y at 50 
/x m (supplemental Fig. S5 and supplemental Table S5), implying 
that the substrate binding pocket conformation of 3CF pro in the 
polyprotein is similar to that in the free 3CF pro . 

Aggregation State of the Polyproteins with the Presence of Sub¬ 
strate or Substrate Analog —The experimental results above 
indicated that the polyproteins were monomeric in solution as 
free proteins. However, we still do not know whether they can 
form transient complexes in solution and what happens when 
they react with substrate. We did chemical cross-linking exper- 
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iments for the two polyproteins without and with the presence 
of a substrate analog, an isatin derivative 5f. Similar to the sub¬ 
strate, 5f was shown to be able to enhance dimer formation for 
SARS-3CL pro in our previous study (24). From the cross-linking 
experiment, we can see that more dimers were formed for the 
wild type protein, the N-terminal cleaved protein, and the full- 
length large protein (Fig. 4). In fact, for the full-length large 
protein, the dimer band showed very weak, if any, formation, 
with EGS cross-linking in the absence of 5f, which turned out to 
be more obvious after cross-linking in the presence of 5f. 


A B C 



FIGURE 4. SDS-PAGE of chemical cross-linking by EGS. A, wild type SARS- 
3CL pro (5.12 mg/ml). B, 3CLP-Y-His (6 mg/ml). C, His-C-XX(Q/E)-3CLP-Y (6 
mg/ml). For each PAGE, from left to right, lane 1 is the enzyme only, lane 2 is 
enzyme with EGS, lane 3 is enzyme with equal molar substrate analog 5f, and 
lane 4 is enzyme with equal molar substrate analog 5f and EGS. 



Sedimentation Coefficient (S) 

FIGURE 5. Sedimentation coefficient distribution c(s) profile of His-C- 
XX(Q/E)-3CLP-Y' without and with the presence of isatin derivative 5f at 
40,000 rpm and 20 °C. 


Due to the experimental condition requirement for the 
cross-linking study, the protein concentration used is quite 
high. As such high concentration may not reflect the real situ¬ 
ation in vivo , we then analyzed the aggregation state of the 
polyprotein His-C-XX(Q/E)-3CLP-Y at a rational concentra¬ 
tion (1 mg/ml) in the presence of 5f (25) at 10 /xm using AUC. 
From the AUC results (Fig. 5), it is clear that the polyprotein is 
monomeric without 5f and partly forms dimer upon the addi¬ 
tion of 5f. Thus with the presence of substrate, the polyproteins 
can also form dimers. 

Dependence of Enzyme Activity on Enzyme Concentration — 
As shown in our previous studies (4, 24), the enzyme concen¬ 
tration dependence of SARS-3CL pro implies a dimer-only activ¬ 
ity control mechanism. For the polyproteins, we also measured 
their apparent k CSLt /K m for pNA peptide substrate at different 
enzyme concentrations. The (k C3Lt /K m ) app increases along with 
the enzyme concentration increase in a similar way as the 
matured enzyme (Fig. 6). We also tested the concentration-de¬ 
pendent enzyme activity using the polyprotein His-C-XX- 
C145A-Y as substrate and observed a similar trend (data not 
shown). Together with the AUC analysis, we conclude that 
dimerization is also required for the polyprotein, although the 
dimer may only form transiently, which can be stabilized by 
substrate. We fitted the enzyme activity data using Equation 6 
to derive (k CSLt /K m ) dimer and association constant K a (Table 1). 

From Table 1, we can see that among the three proteins con¬ 
taining 3CL pro , the intact polyprotein has the lowest ability to 
form active dimer, with a dimer (substrate-induced) dissocia¬ 
tion constant of about 100 /xm, whereas the N-terminal cleaved 
polyprotein has a somewhat stronger dimer formation ability 
(K d around 42 /xm) and the free protein has the strongest dimer 
formation ability (K d around 1.6 /xm). The catalytic activities of 
the dimers formed by the three proteins also follow the same 
order. As the apparent enzyme activity positively depends on 
the dimer activity and the dimer association constant, the intact 
polyprotein has quite low enzyme activity, and upon the first 
step of digestion, the activity of the truncated 3CLP-Y-His will 
be increased, and the matured 3CL pro will have the highest 
enzyme activity. Although different substrates may give differ¬ 
ent quantitative results, the activities should follow the same 
order. 


(k IK ) of 3CL p, ° 

' cat nr app 

B 

{k JK ) of 3CLP-Y-His 

' cat nr app 

c 

(k IK) of His-C-XX(Q/E)-3CLP-Y 

' cat nr app ' ' 



FIGURE 6. Enzyme activity of SARS-3CL pro , 3CLP-YFP-His, and His-C-XX(Q/E)-3CLP-Y at different concentrations. The apparent velocity second-order 
constant {k cat /K m ) app increases in the same manner as matured SARS-3CL pro as the enzyme concentration (Cone E) increases, indicating that the dimer form is 
the active form of the proteinase. ■, measured value of {k cat /K m ) app ; the solid line is the fitted curve. 
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TABLE 1 

Relative enzyme activities toward the pNA peptide substrate 

The proteolysis activity of all the enzymes at different concentrations were 
determined by incubating 0.2 mM pNA peptide with the enzymes in buffer A (40 
mM phosphate-buffered saline, pH 7.3, 100 mM NaCl, 1 mM EDTA) with 5 mM 
dithiothreitol. 


Enzyme 

(k IK ) a 

'•'cat' 'm/dimer 

Kf 

K d b 


mM~ 2 • min~ 1 

1 

I 

fXM 

His-C-XV(Q/E)-3CLP-T 

3.6 ± 0.4 

0.010 ± 0.002 

100 

3CLP-T-His 

6.4 ± 0.6 

0.024 ± 0.004 

42 

3CL pro 

31.8 ± 1.2 

0.61 ± 0.08 

1.6 


a Fitted and determined by Equation 6. 
b K d was derived from K a . 


DISCUSSION 

Substrate-induced Dimerization Is Essentialfor the Activity of 
SARS-3CL pro in the Polyprotein—In vitro studies showed that 
dimerization is essential for the enzyme activity of SARS- 
3CL pro (4). However, whether dimerization is necessary for the 
activity of its precursor, the polyprotein, is still not clear. Hsu 
et al. (22) reported that the partially cleaved polyprotein can 
form a small amount of active dimer and proposed that during 
the autocleavage of the 3CL pro , the enzyme is dimeric at each 
step. However, in their study, only 10 residues were attached to 
both the termini of SARS-3CL pro , which may not be long 
enough to mimic the situation in the polyprotein as the hydro¬ 
phobic regions 1 and 2 (HD1 and HD 2) attached to the N and C 
terminus of 3CL pro are more than 10 amino acids in the repli- 
case of coronavirus (26). 

In the current study, we used full-length folded proteins as an 
attachment to the two ends of 3CL pro to mimic the case in the 
replicase. Our results showed that 3CL pro in the polyprotein 
was monomeric in solution when standing alone, and the activ¬ 
ity was not related to the proteins attached as the polyprotein 
kept its activity when substituting YFP with MBP. Furthermore, 
we also discovered that the N-terminal cleaved product of the 
polyprotein, which occurs when only one protein is attached to 
the C terminus of 3CL pro , was still active. These active polypro¬ 
teins can also be deactivated by the previously reported 3CL pro 
inhibitor 5f, implying that the active site conformation in the 
polyprotein may be similar to that in the active protomer of the 
3CL pro dimer. 

Substrate-enhanced dimerization was observed for the 
matured SARS-3CL pro (24). The dissociation constant of pure 
SARS-3CL pro was measured to be 14 /xm using sedimentation 
equilibrium. However, for most in vitro assays for SARS- 
3CL pro , the enzyme concentration was around 1 /xm. As our 
previous study (4) and various other studies (19) have shown, 
only dimer is active for this enzyme. This brought a dilemma as 
the dimer concentration must be very low under the assay con¬ 
dition assuming a K d of 14 /xm. However, when substrate was 
present, the dimerization ability of SARS-3CL pro was signifi¬ 
cantly increased with a K d around 1 /xm under the experimental 
conditions (24). 

For the case of the polyproteins, we also tested their enzyme 
activity change along with the increase of enzyme concentra¬ 
tion. Similar to SARS-3CL pro , the apparent second-order con¬ 
stant, (k CSLt /K m ) a PP > increases with the enzyme concentration 
(Fig. 6). Thus we postulated that the polyprotein also needs to 
dimerize first to perform its enzymatic function. 


To verify this assumption, we also did EGS cross-linking and 
AUC analysis for the polyproteins with a substrate analog com¬ 
pound 5f. we found that significant dimer can be detected when 
both experiments were done in the presence of 5f. Thus we 
concluded that substrate-induced dimerization is essential for 
the activity of SARS-3CL pro in the polyprotein. 

Possible Mechanism for the Cleavage Reaction of SARS- 
3CL pro in the Polyprotein —Although few autolytic studies have 
been reported for SARS 3CL pro , 3CL pro from other coronavi- 
ruses were studied in more detail. It was reported that the auto¬ 
processing of 3CL pro was related to membrane (27-29), 
although the details of dynamic control during this process are 
still not clear. Our results indicate that although the polypro¬ 
tein remains a monomer as free protein, it can form transient 
dimers upon substrate binding to perform their catalytic 
activity. 

Hsu et al (22) proposed a possible mechanism for the matu¬ 
ration process of SARS-3CL pro from their crystal structure 
study of the C145A mutant. As the active site of one protomer 
in the C145A dimer binds with the C-terminal 6 amino acids of 
the protomer from another asymmetric unit, they believe that 
this kind of structure mimics the product-bound form in the 
maturation process. They also attached 10 extra residues at the 
N terminus and/or the C terminus of the protein and found that 
these proteins also form dimers, although weaker when com¬ 
pared with the matured enzyme. In their mechanism model, for 
the first and second step, two monomers of the polyprotein first 
cleave each other to cut off the N-terminal extra peptides and 
then form a dimer. That is, the first step of the reaction occurs 
between the two monomers within one dimer. 

However, our data showed that the polyproteins with extra 
proteins at both ends can still form dimeric structure under 
substrate induction condition, which prompts for an intermol¬ 
ecule reaction mechanism. For the case of HIV protease matu¬ 
ration, during the first step of digestion, the full-length HIV 
polyprotein also forms dimer by prerequisite and cuts the 
N-flanking site intramolecularly as the initial rates related to 
N-terminal autocleavage linearly increased along with the 
increase of the polyprotein concentration (30). In contrast, our 
results indicated that the initial rates for the SARS polyprotein 
related to the N-terminal or C-terminal autocleavage non-lin- 
early increased along with the increase of the polyprotein con¬ 
centration. That is, the SARS polyprotein may cut its termini 
intermolecularly. In fact, this is also in accordance with the 
crystal structure of Hsu et al (22) as they observed that a dimer 
molecule of the C145A mutant binds with the C-terminal 6 
amino acids of the protomer from another asymmetric unit. 

Based on the above analysis, we proposed an updated version 
for the SARS-3CL pro maturation mechanism (Fig. 7). Shortly 
after the polyprotein translation, a tiny amount of transient 
dimers can form, which become more stabilized by binding to 
its substrate (another polyprotein), and then an intermolecular 
reaction occurs to release the free N terminus of SARS-3CL pro 
in the polyprotein. The one-step product, with a free N termi¬ 
nus but a restricted C terminus of 3CL pro , can be induced into 
the dimer form more easily by the substrate and is more active 
than its precursor, which will make it act as the main enzyme to 
cut the N-terminal flanking site of the other molecules in the 
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Fourth step Third step 

FIGURE 7. Possible mechanism of 3CL pro autoprocessing. In the first step, 
polyproteins A and B form a transient dimer, which cuts the N-terminal flank¬ 
ing site of molecule C (and D, etc.). In the second step, molecule C and D with 
a free N terminus but a restricted C terminus of 3CL pro form a dimer and act as 
the main enzyme to cut molecule A or B through trans-cleavage. For the third 
step, the truncated polyprotein with free N terminus digested its C-terminal 
flanking site to release the free C terminus of 3CL pro monomer. For the fourth 
step, monomers of the matured enzyme assembled into an active dimer of 
3CL pro . 

polyprotein in step two. In the third step, the one-step product 
digests its C-terminal flanking site to release the free C termi¬ 
nus of 3CL pro monomer. In the fourth step, along with the accu¬ 
mulation of 3CL pro monomers, active dimers were assembled 
by two protomers to form matured 3CL pro . 

As the substrate-induced dimerization occurs at each step 
when compared with the conventional “power of two” mecha¬ 
nism (10), this “substrate-dependent power of two” mechanism 
provides an additional way of enzyme activity regulation. For 
example, at the very beginning of viral replication, only a very 
low concentration of the enzyme and the substrate is present, 
so nothing will happen. Along with the polyprotein synthesis, 
substrates are accumulated, which then induce more enzyme 
dimer formation and activate the process of the enzyme diges¬ 
tion. Then along with the digestion reaction, the concentration 
of substrate will be lowered, whereas the matured enzyme con¬ 
centration will be increased. As the matured enzyme has a 
stronger dimer association constant, its increased concentra¬ 
tion results in more active dimers to treat the low concentration 
of substrate until most substrates are consumed. This kind of 
enzyme activity regulation is a highly efficient and economic 
way of activity control and substance usage. We expect that in 
addition to other 3CL pro proteinases, more examples of mul¬ 
tidomain proteins may also follow this type of dynamic control 
mechanism. 
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